SECTION 44 



Khmer Writing 



Eric Schiller 



The signary used in the Khmer language is an Indic-based script which dates back 
about 1500 years. It was used in inscriptions as far back as the sixth century and re- 
mains in use throughout Cambodia. The system is highly complex. Much of the com- 
plexity is due to its long history, since the phonology of the language has changed 
radically while the writing system has remained fairly constant. 

The writing system is alphasyllabic (see section 3 1 ) and written from left to 
right. The primary graphic element represents a consonant, with vowels indicated by 
symbols on either side of the consonant or hovering above or below. The consonant 
is written first, and then the vowel is added, even if the vowel sign is written to the left 
of the consonant. The space below the primary consonant is used for secondary con- 
sonants. Diacritics which affect the interpretation of the consonant appear both above 
and below the consonant, sometimes shifting position depending on the shape of the 
consonant. 

Modern Khmer writing uses a baseline approach, with the primary consonant sit- 
ting on the baseline; some letters have descenders below the line. The remnants of a 
historical "top line" are built into some letters, in the part known as the 'hair' . Khmer 
was originally carved in stone and written on palm leaves. 



The symbols 

Consonants 

There are 33 consonant symbols, presented in table 44.1 , organized by phonologi- 
cal characteristics representing their Indie origin. The symbols are arranged in seven 
rows, five of which reflect the point of articulation of the consonant, proceeding for- 
ward from the back of the mouth; the sixth and seventh rows are reserved for miscel- 
laneous items. One must keep in mind, however, that these phonological 
characteristics apply to an ancient form of the language, and that the presence of 
voiced aspirates at that stage is doubtful. Boxed letters are "series i," others are "se- 
ries 2," and this determines the pronunciation of vowels following them (see below). 
Methods of transliteration vary, especially with regard to the retroflexes and v/w. 
Transliteration is generally based on the historical/Indic values of the letters, and 
therefore it can differ greatly from transcriptions, which are based on the modern pho- 
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TABLE 44.1: Khmer Consonants 





Voiceless Plosives 


VOK CI) 


Plosives 


N 






Unaspirated Aspirated 


Unaspirated 


Aspirated 


ASALS 


Velar 


n 


k [k] 


2 


kh m 


f\ g [k] 


m gh M 


a 


A [1]] 


Palatal 





c [c] 


fi 


ch [&] 


ti J Ic| 


tm jh [ci 


ffl 


fi [p] 


Retroflex 


a 


t [cf] 


M 


th [t1 


8 d \d\ 


m dh m 


m 


n [n] 


Dental 


f\ 


t [t] 


ti 


th m 


% d It] 


a dh [tT 


s 


n [n] 


Bilabial 


U 


P [6] 


ti 


ph [pi 


n b ipi 


fl bh [pi 


H 


m [m] 




SONORANTS AND OTHERS 


Misc. voiced 


m y [j] t r w 


m 1 HI 


f V [w] 






Misc. voiceless 


fll 


s [s] 


Ul 


h [h] 


^J 


1 HI 


H ? [?] 







nology. There are alternative forms of the consonants used in consonant clusters, de- 
rived from these basic shapes, as shown in table 44.3 

Vowels 

The vowel system of Modem Khmer is very complex, and there is still no consensus 
on the number of phonemes involved. From the standpoint of the writing system, the 
greatest complication is that each vowel symbol can have two different sound values, 
depending on the consonant to which it is attached. There are 21 dependent vowel 
signs, which must be used in conjunction with a consonant; they are transliterated in 
TABLE 44.2 with their traditional names, which begin with fl ?. There are also some 
independent vowel signs, used when the vowel is not attached to any consonant. 

When no vowel symbol is used, an inherent vowel is assumed, which can have 
three different values. If the vowel follows a consonant of the first series, it is pro- 
nounced [di], while the second series value is [oi]. In Sanskrit and Pali loans, howev- 
er, [a] is often used. Sometimes words which consist only of a single consonant and 

the inherent vowel are written with the diacritic a.sJ^/ in order to avoid confusion 
with longer words, since IChmer writing does not place spaces between words. An ex- 
ample is fi [koi], a grammatical particle. The diacritic sahnoksanna - is applied in 
cases where the inherent vowel is shortened (and may be later diphthongized) , e.g., 
^1fi gat [ko9t] 'a pronoun, usually third person singular\ 
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TABLE 44.2: 


Khmer Vowel Signs 


and Transliterations 








Independent 


Diacritic 


Transliteration 


Independent 


D/tt^ 


riric 


Transliteration 


H 


- 


H 




?a 




1- 


m 


?e 


m 


-1 


HI 




?a 


a 


J) 


m 


?8 


fi 


- 


H 




?i 


n 


i- 


m 


?ay 


«] 


- 


» 




T\ 


§,6 


\A 


mi 


?o 




e£ 


ji 




?i 


i 


1 1 


mi 


?aw 




au 


5 




ri 




; 


u 


?um 


a 


I 


1 




?o 




._ 




n 


?om 


a 


V 


1/ 




?u 




^1 


HI 


?am 


i 


4 

V 

H 






?9W 

?5 




4 

-4 


HS 
} 

T 


?ah 

r 
1 




'-] 


«] 




?i9 


n 


41J 




1 




u 


«i) 




?i3 


n 


-nJ 


nJ 


1 



Clusters 

When a syllable begins with two consecutive consonants, the second consonant is 
written as a subscript to the first; in that case, a vowel that appears below the conso- 
nant is written underneath the subscript, e.g., Ut phcuar [phcua] 'to plow'. In addi- 
tion, geminate consonants in words borrowed from Sanskrit and Pali are often written 
as doubled consonants (see table 44.3), though there is no phonological gemination 
in Khmer: HtfinOtirifU Patekecchabokkal [?ateikecch9boku3ll 'fatahst'. 



Special cases 

There are a variety of special forms which apply to combinations of letters. Thus tp 
is used for the combination of [b] + [a:], to avoid confusion with tfl [ha:]. The letter 

in {h+ph) is an artificial constuct used for the foreign sound [f], e.g., filltfl [kaifei] 
'coffee'. 
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TABLE 44.3: Khmer Consonants with Subscript Forms" 



Velar 


n 


k 


2 

8 


kh 


P 


g 


mi 


gh 





n 


Palatal 


g 


c 


IJ 


ch 


g 


j 


m 


jh 


ni 

£3 


n 


Retroflex 


« 


t 


a 


th 


i 


d 


m 


dh 


fUl 

an 


n 


Dental 


n 


t 




th 


i 


d 


a 


dh 


s 


n 


Bilabial 


1 


P 




ph 


S 


b 


n 


bh 


IS 


m 


Misc. voiced 


mj 


y 


U 


r 


fy 


1 


1 

«7 


V 






Misc. voiceless 


wj 


s 


tn 

tA 


h 


!n 


1 


ft 


? 







a. The full forms of consonants are for identification only; most of the clusters represented here do not occur 
in the language. 



Correspondences 

The modem pronunciation of these sounds is quite far from those of Old Khmer, and 
varies significantly in regional dialects. As the pronunciation of the Phnom Penh di- 
alect is not very representative of Khmer as a whole, the Battambang dialect is used 
throughout this section. The sound system of Khmer has undergone several radical 
changes. A voicing contrast in initial consonants gave way to a contrast in phonation 
type of the following vowel. The voiced consonants lost their voicing but the follow- 
ing vowels retained a reflection of this voicing when they acquired a breathy phona- 
tion type ("register"). Eventually this breathy phonation type disappeared, leaving 
behind only an altered vowel quality. These changes are summarized in table 44.4, 
which also shows the written form of these Khmer sounds. Note that the same vowel 
symbol now represents two different sounds in most cases. 

It should be noted that details of both transliteration and transcription vary in or- 
thography. The transcription of diphthongs in particular varies widely. In fact, one 
rarely finds two scholars using the same approach. The most common systems are 
those described in Henderson 1952, Jacob 1968, Huffman 1970, and Diffloth 1992. 
The system used here is based on Diffloth's. 

The relationship betwen consonant series and vowel quality is made more elab- 
orate by the use of diacritics that "switch" series. For example, to combine m, in series 
2, with [ai], in series i, one adds a diacritic, trdysap -; compare H1 md [mio] *uncle' 
and Hin mdk [maik] 'mark'. (For series i consonants, a mUsekdtodn - is used.) The 
situation is further complicated by the fact that vowels and elements of the consonant 
sometimes occupy the area above the main consonant. In the word W [sii] 'to eat' the 
vowel [ii] is written on top of the s, so that there is no room for a trsysap. In such cas- 
es, a different diacritic is used, the kbiek kraom, which is identical to the vowel w, un- 
derneath the consonant symbol. 



TABLE 44.4: Khmer Vowel Series 





First Series 






Second Series 




Written 


Transliteration 


Transcription 


Written 


Transliteration 


Transcription 


n 


ka 


[kDi] 


n 


ga 


[ko:] 


m 


ka 


[kai] 


ni 


m 


[kia] 


h 


ki 


[ke] 


n 


gi 


[ki] 


n 


ki 


[k9i] 


n 


g"! 


[ki:] 


n 


ki 


[k9] 


n 


g* 


[ki] 


n 


ki 


[koi] 


n 


g* 


[ki:] 


1 


ku 


[ko] 


n 


gu 


[ku] 


V 


ku 


[kou] 


P 


gu 


[ku:] 





kU9 


[kuo] 


D 


gU3 


[ku3] 


in 


k5 


[kaQ] 


in 


g^ 


[ka:] 


iff 

u 


kia 


[kio] 


A 


gt^ 


[kia] 




ki9 


[ki9] 


in 

u 


gii^ 


[kia] 


in 


ke 


[kei] 


in 


ke 


[ke;] 


in 


k£ 


[kae] 


in 


ke 


[ke:] 


in 


kay 


[kay] 


in 


gay 


[kiy] 


mi 


ko 


[kao] 


ini 


go 


[ko:] 


inl 


kaw 


[kau] 


inl 


gaw 


[kiw] 


n 


kum 


[kom] 


\ 


gum 


[kum] 


n 


ksm 


[kDm] 


n 


gcim 


[kum] 


ni 


kam 


[kam] 


ni 


gam 


[koam] 


ns 


kah 


[kah] 


n^ 


gah 


[kea„h] 



In addition, Indie loanwords obey a different set of pronunciation guidelines, 
which are not consistently followed. A detailed discussion is presented in Jacob 1968: 
47-51. For example, the initial consonant transliterated as h 11 is pronounced [p] in 
such loans as U1l1f5n bdrdsit [pairaisst] 'parasite', but in Indie words which have be- 
come assimilated to the Khmer vocabulary it is pronounced [6], for example tlfUlJ 
bony [bon] 'reUgious festival' (cf. Sanskrit punya 'merit'). Highly important reh- 
gious terms are not immune to naturalization, so that the most sacred terms have been 
absorbed: 1)9 buddh [put] 'Buddha', a§ dharm [too] 'Dhanna, law', WfiJ sangh 



► PART VII: SOUTHEAST ASIAN WRITING SYSTEMS 
TABLE 44.5: Khmer Numerals 



I 


2 


3 


4 


5 


6 


7 


8 


9 





9 


10 


m 


a 


a 


a 


ni 


d 


g 






[sDq] 'Sangha, priesthood'. The word dharm also illustrates the symbol - r9bdt, used 
to write preconsonantal r in Indie loanwords. 

Note that many final consonants are not pronounced. A sign used to indicate un- 
pronounced elements of some loanwords is the tomddkhiet 'kill sign' -, e.g. 
fSfifiUfUl sitdkdrn [sotskai] 'smile'. 

The lack of consistency with regard to matters of transcription and transliteration 

is even evident in the name of the country where most Khmer speakers reside. fiHtil 
is, in transliteration, kambujd, but the modern pronunciation is [kampucis]. This is 
reflected in the official name of the country, which was Kampuchea until quite recent- 
ly, when the older form (Cambodia) has made a comeback due to political changes. 
The Khmer spelling has remained constant, however. 

Punctuation and numerals 

Words are not separated in Khmer script. A marker 1 khan is used to indicate change 
of topic and comes at the end of what amounts to a group of sentences, though often 
less than a paragraph. A special sign lO to indicates that the previous word or group 
of words is repeated. This is a common sign, because Khmer morphology applies re- 
dupHcation for a variety of grammatical purposes. + 

Some colloquial words are written with a kdkhdt - on top, e.g., U^^ [cah] 're- 
sponse particle used in women's speech' . 

The numerals are shown in table 44.5. 

Sample of Khmer 

tnmisiommfitTis|SimHmitfimsatris8sjimma§RH^ 



/. Khmer: 




mm 


tsio mil m 01 


|[,^ ]t\ m 


V 


2. Transliteration: 


pel 


nuh gat rut ban trl rah 


musy 


3. Transcription: 


peil 


nuh koot rut 6a :n trsi mh 


muoy 


4. Gloss: 




time 


that PRON fish. with. basket trei-rah 


one 


I. itnm 


SU 


tns 


gsjim mn 


9n m 




2. h5y 


nir) 


ban 


dansay larj 


dik musy 




3. hasj 


nor) 


Bam 


tosnsaij lost] 


tik mu9j 




4. already 


then 


get 


rabbit sink 


water one 
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'At that time, he fished out a trei-rah (kind of fish) and then got a rabbit which 
was drowning in the water.' — From a Khmer folktale. 
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